AITopics | outer objective

Collaborating Authors

outer objective

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

aa2a77371374094fe9e0bc1de3f94ed9-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 18:26:34 GMT

objective, outer objective, selection, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.73)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

aa2a77371374094fe9e0bc1de3f94ed9-Supplemental.pdf

Neural Information Processing SystemsAug-15-2025, 16:54:00 GMT

objective, outer objective, selection, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Coresets via Bilevel Optimization for Continual Learning and Streaming

Borsos, Zalán, Mutný, Mojmír, Krause, Andreas

arXiv.org Machine LearningOct-22-2020

Coresets are small data summaries that are sufficient for model training. They can be maintained online, enabling efficient handling of large data streams under resource constraints. However, existing constructions are limited to simple models such as k-means and logistic regression. In this work, we propose a novel coreset construction via cardinality-constrained bilevel optimization. We show how our framework can efficiently generate coresets for deep neural networks, and demonstrate its empirical benefits in continual learning and in streaming settings.

artificial intelligence, coreset, machine learning, (18 more...)

arXiv.org Machine Learning

2006.03875

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre:

Research Report > New Finding (0.48)
Research Report > Experimental Study (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback

Learning Discrete Structures for Graph Neural Networks

Franceschi, Luca, Niepert, Mathias, Pontil, Massimiliano, He, Xiao

arXiv.org Machine LearningApr-29-2019

Relational learning is concerned with methods that cannot only leverage the attributes of data points but also their relationships. Diagnosing a patient, for example, not only depends on the patient's vitals and demographic information but also on the same information about their relatives, the information about the hospitals they have visited, and so on. Relational learning, therefore, does not make the assumption of independence between data points but models their dependency explicitly. Graphs are a natural way to represent relational information and there is a large number of machine learning algorithms leveraging graph structure. Graph neural networks (GNNs) (Scarselli et al., 2009) are one such class of algorithms that are able to incorporate sparse and discrete dependency structures between data points. While a graph structure is available in some domains, in others it has to be inferred or constructed. A possible approach is to first create a k-nearest neighbor (kNN) graph based on some measure of similarity between data points. This is a common strategy used by several learning methods such as LLE (Roweis & Saul, 2000) and Isomap (Tenenbaum et al., 2000).

artificial intelligence, graph, machine learning, (18 more...)

arXiv.org Machine Learning

1903.1196

Country:

Europe (0.93)
North America > United States > New York (0.14)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Bilevel Programming for Hyperparameter Optimization and Meta-Learning

Franceschi, Luca, Frasconi, Paolo, Salzo, Saverio, Grazzi, Riccardo, Pontil, Massimilano

arXiv.org Machine LearningJul-3-2018

We introduce a framework based on bilevel programming that unifies gradient-based hyperparameter optimization and meta-learning. We show that an approximate version of the bilevel problem can be solved by taking into explicit account the optimization dynamics for the inner objective. Depending on the specific setting, the outer variables take either the meaning of hyperparameters in a supervised learning problem or parameters of a meta-learner. We provide sufficient conditions under which solutions of the approximate problem converge to those of the exact problem. We instantiate our approach for meta-learning in the case of deep learning where representation layers are treated as hyperparameters shared across a set of training episodes. In experiments, we confirm our theoretical findings, present encouraging results for few-shot learning and contrast the bilevel approach against classical approaches for learning-to-learn.

artificial intelligence, machine learning, optimization problem, (15 more...)

arXiv.org Machine Learning

1806.0491

Country:

Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
Europe > Italy > Tuscany > Florence (0.04)
Europe > Italy > Liguria > Genoa (0.04)

Genre: Research Report (1.00)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Far-HO: A Bilevel Programming Package for Hyperparameter Optimization and Meta-Learning

Franceschi, Luca, Grazzi, Riccardo, Pontil, Massimiliano, Salzo, Saverio, Frasconi, Paolo

arXiv.org Machine LearningJun-13-2018

In (Franceschi et al., 2018) we proposed a unified mathematical framework, grounded on bilevel programming, that encompasses gradient-based hyperparameter optimization and meta-learning. We formulated an approximate version of the problem where the inner objective is solved iteratively, and gave sufficient conditions ensuring convergence to the exact problem. In this work we show how to optimize learning rates, automatically weight the loss of single examples and learn hyper-representations with Far-HO, a software package based on the popular deep learning framework TensorFlow that allows to seamlessly tackle both HO and ML problems.

artificial intelligence, machine learning, optimization problem, (14 more...)

arXiv.org Machine Learning

1806.04941

Country:

Europe > Italy (0.06)
Europe > United Kingdom > England > Greater London > London (0.05)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

A Bridge Between Hyperparameter Optimization and Larning-to-learn

Franceschi, Luca, Donini, Michele, Frasconi, Paolo, Pontil, Massimiliano

arXiv.org Machine LearningFeb-4-2018

We consider a class of a nested optimization problems involving inner and outer objectives. We observe that by taking into explicit account the optimization dynamics for the inner objective it is possible to derive a general framework that unifies gradient-based hyperparameter optimization and meta-learning (or learning-to-learn). Depending on the specific setting, the variables of the outer objective take either the meaning of hyperparameters in a supervised learning problem or parameters of a meta-learner. We show that some recently proposed methods in the latter setting can be instantiated in our framework and tackled with the same gradient-based algorithms. Finally, we discuss possible design patterns for learning-to-learn and present encouraging preliminary experiments for few-shot learning.

artificial intelligence, machine learning, representation, (15 more...)

arXiv.org Machine Learning

1712.06283

Country: Oceania > Australia > New South Wales > Sydney (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback